Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Segmentation of Text and Graphics/Images Using the Gray-Level Histogram Fourier Transform

Identifieur interne : 001D13 ( Main/Exploration ); précédent : 001D12; suivant : 001D14

Segmentation of Text and Graphics/Images Using the Gray-Level Histogram Fourier Transform

Auteurs : A. Patricio [Espagne] ; D. Maravall [Espagne]

Source :

RBID : ISTEX:025E437C70CB7A833E07B7C71CEA5D2C40CA8CD9

Abstract

Abstract: One crucial issue in automatic document analysis is the discrimination between text and graphics/images. This paper presents a novel, robust method for the segmentation of text and graphics/images in digitized documents. This method is based on the representation of window-like portions of a document by means of their gray level histograms. Through empirical evidence it is shown that text and graphics/images regions have different gray level histograms. Unlike the usual approach for the characterization of histograms that is based on statistics parameters a novel approach is introduced. This approach works with the histogram Fourier transform since it possesses all the information contained in the histogram pattern. The next and logical step is to automatically select the most discriminant spectral components as far as the text and graphics/images segmentation goal is concerned. A fully automated procedure for the optimal selection of the discriminant features is also expounded. Finally, empirical results obtained for the text and graphics/images segmentation using a simple three-layer perceptron-like neural network are also discussed.

Url:
DOI: 10.1007/3-540-44522-6_78


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Segmentation of Text and Graphics/Images Using the Gray-Level Histogram Fourier Transform</title>
<author>
<name sortKey="Patricio, A" sort="Patricio, A" uniqKey="Patricio A" first="A." last="Patricio">A. Patricio</name>
</author>
<author>
<name sortKey="Maravall, D" sort="Maravall, D" uniqKey="Maravall D" first="D." last="Maravall">D. Maravall</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:025E437C70CB7A833E07B7C71CEA5D2C40CA8CD9</idno>
<date when="2000" year="2000">2000</date>
<idno type="doi">10.1007/3-540-44522-6_78</idno>
<idno type="url">https://api.istex.fr/document/025E437C70CB7A833E07B7C71CEA5D2C40CA8CD9/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">002547</idno>
<idno type="wicri:Area/Istex/Curation">002379</idno>
<idno type="wicri:Area/Istex/Checkpoint">001316</idno>
<idno type="wicri:doubleKey">0302-9743:2000:Patricio A:segmentation:of:text</idno>
<idno type="wicri:Area/Main/Merge">001E13</idno>
<idno type="wicri:Area/Main/Curation">001D13</idno>
<idno type="wicri:Area/Main/Exploration">001D13</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Segmentation of Text and Graphics/Images Using the Gray-Level Histogram Fourier Transform</title>
<author>
<name sortKey="Patricio, A" sort="Patricio, A" uniqKey="Patricio A" first="A." last="Patricio">A. Patricio</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Espagne</country>
<wicri:regionArea>Centro de Cálculo Artificial, Universidad Politécnica de Madrid</wicri:regionArea>
<wicri:noRegion>Universidad Politécnica de Madrid</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Maravall, D" sort="Maravall, D" uniqKey="Maravall D" first="D." last="Maravall">D. Maravall</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Espagne</country>
<wicri:regionArea>Departamento de Inteligencia Artificial, Universidad Politécnica de Madrid</wicri:regionArea>
<wicri:noRegion>Universidad Politécnica de Madrid</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Espagne</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2000</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">025E437C70CB7A833E07B7C71CEA5D2C40CA8CD9</idno>
<idno type="DOI">10.1007/3-540-44522-6_78</idno>
<idno type="ChapterID">78</idno>
<idno type="ChapterID">Chap78</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: One crucial issue in automatic document analysis is the discrimination between text and graphics/images. This paper presents a novel, robust method for the segmentation of text and graphics/images in digitized documents. This method is based on the representation of window-like portions of a document by means of their gray level histograms. Through empirical evidence it is shown that text and graphics/images regions have different gray level histograms. Unlike the usual approach for the characterization of histograms that is based on statistics parameters a novel approach is introduced. This approach works with the histogram Fourier transform since it possesses all the information contained in the histogram pattern. The next and logical step is to automatically select the most discriminant spectral components as far as the text and graphics/images segmentation goal is concerned. A fully automated procedure for the optimal selection of the discriminant features is also expounded. Finally, empirical results obtained for the text and graphics/images segmentation using a simple three-layer perceptron-like neural network are also discussed.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Espagne</li>
</country>
</list>
<tree>
<country name="Espagne">
<noRegion>
<name sortKey="Patricio, A" sort="Patricio, A" uniqKey="Patricio A" first="A." last="Patricio">A. Patricio</name>
</noRegion>
<name sortKey="Maravall, D" sort="Maravall, D" uniqKey="Maravall D" first="D." last="Maravall">D. Maravall</name>
<name sortKey="Maravall, D" sort="Maravall, D" uniqKey="Maravall D" first="D." last="Maravall">D. Maravall</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001D13 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001D13 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:025E437C70CB7A833E07B7C71CEA5D2C40CA8CD9
   |texte=   Segmentation of Text and Graphics/Images Using the Gray-Level Histogram Fourier Transform
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024